The battle for user-friendly bioinformatics

نویسنده

  • David Roy Smith
چکیده

My first experience of doing scientific research came in the third year of my undergrad when genetics professor Marty Snyder kindly gave me a summer job in her lab at Acadia University. On my first day at work, Marty handed me a CD labeled “scallop data,” and then directed me to a towering Power Mac G4 computer. “See this?” she said, slapping the top of the G4. “Over the next few months you are going to become very well acquainted with this beast.” My assignment was to assemble a segment of the sea scallop genome. To do this, I needed to use a software package called AutoAssembler. “Are you good with computers?” Marty asked. Not really, but I answered yes. “Great,” she said, on her way out the door—“Just insert the CD and click the double-helix icon.” And so began my odyssey into the world of bioinformatics. AutoAssembler was easy to use, and in no time I was digitally piecing together chunks of scallop DNA. The software had an intuitive, graphical user interface (GUI), which allowed me to drag-anddrop and point-and-click my way to scientific success. For someone who had never done research before, it was exhilarating to see hundreds of DNA sequences and their corresponding chromatograms, like long mountain ranges, spread across the screen. Then poof, with the push of a button, I could transform these genetic puzzle pieces into full-length genes. The experience also gave me the courage to explore other bioinformatics resources online. Before the week was out, I was blasting this, aligning that, and bootstrapping it all together. I was fast becoming a genomic junky, so much so that I asked Marty if I could have a copy of AutoAssembler to use on my laptop computer at home. The answer was no, of course. “Commercial bioinformatics software packages, like AutoAssembler,” Marty explained, “are very expensive and, unfortunately, the lab can only afford one license.” Not to worry, I thought. I’ll just download one of the many open-source genome assemblers that are available online. I soon discovered, however, that most of them, although powerful, are command-line driven, can take weeks to learn, and provide little in the way of instruction or technical support. After a few failed attempts at using some of these programs, I scurried back to AutoAssembler with my technological-tail between my legs. Years later, I found myself on the other side of the country working in a bioinformatics-focused lab where all around me was the buzz of RAM’ed up computers and Linux operating systems, and even the coffee machines seemed like they were command-line driven. In this environment, drag and drop was for amateurs and GUI was a dirty word. But late at night, in the privacy of my one-bedroom apartment, I would covertly run my favorite user-friendly bioinformatics tools. I had CodonCode Aligner for assembling Sanger data, a student license of Geneious for genome annotation and alignments, MEGA for basic phylogenies, and an academic copy of CLC Workbench for next-generation sequence analysis. These programs were more than adequate for addressing most of my bioinformatics needs and were certainly more enjoyable to use than the Unix workstations and barebones programs in the lab. Nevertheless, I did understand why the lab avoided the types of GUI software that I was so fond of: they can be costly, memory-hungry, slow, poor at handling massive datasets, and, because of their complex underlying code, difficult to customize or modify. There is also a lot to be said for mastering the use and theory of the open-source programs upon which the commercial tools are based. Over time, I discovered that I wasn’t the only one in the lab with a penchant for the point and click. Although reluctant to admit to it, my colleagues were impressed by many of the cutting-edge commercial bioinformatics platforms hitting the market, which, unlike their predecessors, were fast, powerful, beautifully designed, and provided wide-ranging functionality. Similar to the operating systems on smartphones, contemporary bioinformatics software suites are multi-faceted, allowing users to download applications (or “plugins”) for specific types of analyses, and integrate both open-source as well as proprietary algorithms, making the software flexible and scalable to users’ needs. They also provide an excellent way to organize and access molecular sequence data, and support the import and export of dozens of different file formats. But as one of my lab mates said: “Why should I pay hundreds of dollars for a prettied-up, allin-one package of programs that I can get for free?” That same person, however, did not think twice about forking out the big bucks on Adobe Photoshop for making publication-quality images. Free software or not, it seemed like everyone in the department, from ecologists to population geneticists to cell biologists, was dealing with bioinformatics issues. Each day, researchers were stopping by the lab to ask my computerwhiz colleagues for advice. Most had used next-generation sequencing technologies to complement their studies and were looking for straightforward ways to analyze their data. Some had very specific but complex questions, such as, “How do I set up a pipeline for genome assembly and annotation?” Whereas others would ask: “I just received a 5 GB fastq file of Illumina RNA-seq data, what do I do next?” For the

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Effect of User-Friendly Texts vs. Impersonal and Hybrid Texts on the Reading Comprehension Ability of Iranian EFL Learners

     This study focuses on the effect of user-friendly, impersonal, and hybrid texts on the reading comprehension ability of Iranian foreign language learners. Forty-five students of AlzahraUniversity were selected on the basis of their performance in a recent TOEFL. They were given three different texts (each group of 15 students was given one type) describing the same area of English usage, w...

متن کامل

wEMBOSS: a web interface for EMBOSS

UNLABELLED wEMBOSS provides a web environment from which the user can access EMBOSS in a user-friendly way. wEMBOSS supplies each user with space and tools to organize and review his or her work. AVAILABILITY wEMBOSS can be downloaded at http://www.wemboss.org CONTACT [email protected].

متن کامل

lga972: a cross-platform application for optimizing LD studies using a genetic algorithm

lga972 is a user-friendly cross-platform application with a graphical interface for determining the design features of two-stage genetic linkage disequilibrium studies that minimize the genotyping burden.

متن کامل

A free user friendly program for evaluation of radiotherapy plans based on different dose response models

Introduction: Radiotherapy (RT) plan evaluation using dose response models has become a feasible approach in routine clinical practice. Although there are several tools for this task, they suffer from limitations including number of different dose response models and parameters. In the present study, we aimed to develop a free program for RT plan evaluation based on a variety ...

متن کامل

SubLoc: a server/client suite for protein subcellular location based on SOAP

Based on SOAP(Simple Object Access Protocol) technology, the SubLoc server/client suite offers a user-friendly interface for searching and predicting protein subcellular location.

متن کامل

MollDE: a homology modeling framework you can click with

UNLABELLED Molecular Integrated Development Environment (MolIDE) is an integrated application designed to provide homology modeling tools and protocols under a uniform, user-friendly graphical interface. Its main purpose is to combine the most frequent modeling steps in a semi-automatic, interactive way, guiding the user from the target protein sequence to the final three-dimensional protein st...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 4  شماره 

صفحات  -

تاریخ انتشار 2013